Picture for Yang Xiao

Yang Xiao

Benchmarking Multimodal LLMs on Code Generation for Complex Interactive Webpages

Add code
May 29, 2026
Viaarxiv icon

Learning When to Think While Listening in Large Audio-Language Models

Add code
May 26, 2026
Viaarxiv icon

Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory

Add code
May 26, 2026
Viaarxiv icon

Rethinking Continual Learning for Speech and Audio: A Representation-Centric Taxonomy and Open Problems

Add code
May 24, 2026
Viaarxiv icon

Knowledge Visualization: A Benchmark and Method for Knowledge-Intensive Text-to-Image Generation

Add code
Apr 24, 2026
Viaarxiv icon

RSGMamba: Reliability-Aware Self-Gated State Space Model for Multimodal Semantic Segmentation

Add code
Apr 14, 2026
Viaarxiv icon

SepSeq: A Training-Free Framework for Long Numerical Sequence Processing in LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation

Add code
Mar 25, 2026
Viaarxiv icon

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

Add code
Mar 05, 2026
Viaarxiv icon

Your Language Model Secretly Contains Personality Subnetworks

Add code
Feb 06, 2026
Viaarxiv icon